16 research outputs found

    Mining Predictive Patterns and Extension to Multivariate Temporal Data

    Get PDF
    An important goal of knowledge discovery is the search for patterns in the data that can help explaining its underlying structure. To be practically useful, the discovered patterns should be novel (unexpected) and easy to understand by humans. In this thesis, we study the problem of mining patterns (defining subpopulations of data instances) that are important for predicting and explaining a specific outcome variable. An example is the task of identifying groups of patients that respond better to a certain treatment than the rest of the patients. We propose and present efficient methods for mining predictive patterns for both atemporal and temporal (time series) data. Our first method relies on frequent pattern mining to explore the search space. It applies a novel evaluation technique for extracting a small set of frequent patterns that are highly predictive and have low redundancy. We show the benefits of this method on several synthetic and public datasets. Our temporal pattern mining method works on complex multivariate temporal data, such as electronic health records, for the event detection task. It first converts time series into time-interval sequences of temporal abstractions and then mines temporal patterns backwards in time, starting from patterns related to the most recent observations. We show the benefits of our temporal pattern mining method on two real-world clinical tasks

    A Temporal Abstraction Framework for Classifying Clinical Temporal Data

    No full text
    The increasing availability of complex temporal clinical records collected today has prompted the development of new methods that extend classical machine learning and data mining approaches to time series data. In this work, we develop a new framework for classifying the patient’s time-series data based on temporal abstractions. The proposed STF-Mine algorithm automatically mines discriminative temporal abstraction patterns from the data and uses them to learn a classification model. We apply our approach to predict HPF4 test orders from electronic patient health records. This test is often prescribed when the patient is at the risk of Heparin induced thrombocytopenia (HIT). Our results demonstrate the benefit of our approach in learning accurate time series classifiers, a key step in the development of intelligent clinical monitoring systems

    A Pattern Mining Approach for Classifying Multivariate Temporal Data

    No full text
    Abstract—We study the problem of learning classification models from complex multivariate temporal data encountered in electronic health record systems. The challenge is to define a good set of features that are able to represent well the temporal aspect of the data. Our method relies on temporal abstractions and temporal pattern mining to extract the classification features. Temporal pattern mining usually returns a large number of temporal patterns, most of which may be irrelevant to the classification task. To address this problem, we present the minimal predictive temporal patterns framework to generate a small set of predictive and non-spurious patterns. We apply our approach to the real-world clinical task of predicting patients who are at risk of developing heparin induced thrombocytopenia. The results demonstrate the benefit of our approach in learning accurate classifiers, which is a key step for developing intelligent clinical monitoring systems. I
    corecore